A probabilistic approach to prosodic word prediction for Mandarin Chinese TTS
نویسندگان
چکیده
Prosodic word is a basic rhythmic unit of Mandarin Chinese Speech. It is one of the most important factors determining the naturalness of the generated speech by a TTS system. This paper investigates the problem of predicting Chinese prosodic words from word sequence. First, we examine the patterns of Chinese prosodic words and investigate the key features for prediction. Then a baseline model of CART is used. Based on this model, the effects of the number of POS categories and the number of single word categories are investigated. Finally, a Markov chain approach is proposed. This model has the advantages of both CART approach and other statistical approaches, while the drawbacks of those approaches are avoided. Experiment shows that the proposed Markov chain approach outperforms the simple CART approach.
منابع مشابه
An NN-based Approach to Prosodic for Synthesizing English Words Em
In this paper, a neural network-based approach to generating proper prosodic information for spelling/reading English words embedded in background Chinese texts is discussed. It expands an existing RNN-based prosodic information generator for Mandarin TTS to an RNN-MLP scheme for Mandarin-English mixed-lingual TTS. It first treats each English word as a Chinese word and uses the RNN, trained fo...
متن کاملDecision Tree based Duration Prediction in Mandarin TTS System
This paper reports the methodology and results of decision tree based duration prediction for a Mandarin text-to-speech system developed by the Fujitsu Laboratories. Syllable initials and finals are the basic units in this duration study. Factors influencing finals duration such as phrase boundary and phone context are discussed in detail. Experiments indicate that it is the most important dete...
متن کاملProsodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification1
Prosodic boundary prediction is the key to improving the intelligibility and naturalness of synthetic speech for a TTS system. This paper investigated the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin Chinese. Maximum Entropy (ME) Model was used at the front end for both prosodic word a...
متن کاملProsodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification
Prosodic boundary prediction is the key to improving the intelligibility and naturalness of synthetic speech for a TTS system. This paper investigated the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin Chinese. Maximum Entropy (ME) Model was used at the front end for both prosodic word a...
متن کاملProsodic Word Grouping in Mandarin TTS System
This paper reports the methodology and results of prosodic word grouping for a Mandarin TTS system developed by the Fujitsu Laboratories. In view of any inner prosodic word break will make speech unintelligible or unnatural, a new prosodic word grouping framework is proposed. The word segmentation result can be regarded as an initial prosodic word sequence with grids inserted into each word bou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005